AITopics

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.60)

Neural Information Processing SystemsJan-24-2025, 02:26:16 GMT

Reviews: Paraphrase Generation with Latent Bag of Words

Thus paper presents a model where a latent bag-of-words inform a paraphrase generation model. For each source words, the authors compute a multinomial over "neighbor" vocabulary words; this then yields a bag-of-words by a mixture of softmaxes over these neighbors. In the generative process, a set of words is drawn from this distribution, then their word embeddings are averaged to form input to the decoder. During training, the authors use a continuous relaxation of this with Gumbel top-k sampling (a differentiable way to sample k of these words without replacement). The words are averaged and fed into the LSTM's initial state.

decoder, latent bag, paraphrase generation, (6 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.88)

Neural Information Processing SystemsJan-24-2025, 02:26:05 GMT

Reviews: Paraphrase Generation with Latent Bag of Words

The paper proposes a two-stage model for sentence-level paraphrase generation, trained end-to-end. The first stage is content planning (specifically predicting a'latent' bag of keywords). The second one is the surface realization stage (forming a sentence relying on the keywords). The model is interesting and novel. The evaluation is sufficiently convincing (the author response, I believe, addressed initial concerns of the reviewer 1).

latent bag, paraphrase generation, reviewer, (2 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.40)

Neural Information Processing SystemsOct-10-2024, 03:19:29 GMT

Paraphrase Generation with Latent Bag of Words

latent bag, latent variable, paraphrase generation, (3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.65)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.64)

Jayawardena, Lasal, Yapa, Prasan

Parameter Efficient Diverse Paraphrase Generation Using Sequence-Level Knowledge Distillation

arXiv.org Artificial IntelligenceApr-18-2024

Over the past year, the field of Natural Language Generation (NLG) has experienced an exponential surge, largely due to the introduction of Large Language Models (LLMs). These models have exhibited the most effective performance in a range of domains within the Natural Language Processing and Generation domains. However, their application in domain-specific tasks, such as paraphrasing, presents significant challenges. The extensive number of parameters makes them difficult to operate on commercial hardware, and they require substantial time for inference, leading to high costs in a production setting. In this study, we tackle these obstacles by employing LLMs to develop three distinct models for the paraphrasing field, applying a method referred to as sequence-level knowledge distillation. These distilled models are capable of maintaining the quality of paraphrases generated by the LLM. They demonstrate faster inference times and the ability to generate diverse paraphrases of comparable quality. A notable characteristic of these models is their ability to exhibit syntactic diversity while also preserving lexical diversity, features previously uncommon due to existing data quality issues in datasets and not typically observed in neural-based approaches. Human evaluation of our models shows that there is only a 4% drop in performance compared to the LLM teacher model used in the distillation process, despite being 1000 times smaller. This research provides a significant contribution to the NLG field, offering a more efficient and cost-effective solution for paraphrasing tasks.

computational linguistic, diversity, proceedings, (14 more...)

doi: 10.1109/ICACS60934.2024.10473289

2404.12596

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
North America > Dominican Republic (0.04)
(14 more...)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Gohsen, Marcel, Hagen, Matthias, Potthast, Martin, Stein, Benno

Task-Oriented Paraphrase Analytics

arXiv.org Artificial IntelligenceMar-26-2024

Since paraphrasing is an ill-defined task, the term "paraphrasing" covers text transformation tasks with different characteristics. Consequently, existing paraphrasing studies have applied quite different (explicit and implicit) criteria as to when a pair of texts is to be considered a paraphrase, all of which amount to postulating a certain level of semantic or lexical similarity. In this paper, we conduct a literature review and propose a taxonomy to organize the 25 identified paraphrasing (sub-)tasks. Using classifiers trained to identify the tasks that a given paraphrasing instance fits, we find that the distributions of task-specific instances in the known paraphrase corpora vary substantially. This means that the use of these corpora, without the respective paraphrase conditions being clearly defined (which is the normal case), must lead to incomparable and misleading results.

computational linguistic, dataset, proceedings, (16 more...)

2403.17564

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York (0.04)
Europe > Portugal > Lisbon > Lisbon (0.04)
(30 more...)

Genre: Overview (1.00)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Varghese, Christeena, Koshelev, Sergey, Yamshchikov, Ivan P.

Neural Machine Translation for Malayalam Paraphrase Generation

arXiv.org Artificial IntelligenceJan-31-2024

This study explores four methods of generating paraphrases in Malayalam, utilizing resources available for English paraphrasing and pre-trained Neural Machine Translation (NMT) models. We evaluate the resulting paraphrases using both automated metrics, such as BLEU, METEOR, and cosine similarity, as well as human annotation. Our findings suggest that automated evaluation measures may not be fully appropriate for Malayalam, as they do not consistently align with human judgment. This discrepancy underscores the need for more nuanced paraphrase evaluation approaches especially for highly agglutinative languages.

machine translation, malayalam, translation, (13 more...)

2401.17827

Country:

Europe > Germany > Bavaria > Lower Franconia > Würzburg (0.05)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
(6 more...)

Genre: Research Report > New Finding (0.87)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Wahle, Jan Philip, Gipp, Bela, Ruas, Terry

Paraphrase Types for Generation and Detection

arXiv.org Artificial IntelligenceOct-23-2023

Current approaches in paraphrase generation and detection heavily rely on a single general similarity score, ignoring the intricate linguistic properties of language. This paper introduces two new tasks to address this shortcoming by considering paraphrase types - specific linguistic perturbations at particular text positions. We name these tasks Paraphrase Type Generation and Paraphrase Type Detection. Our results suggest that while current techniques perform well in a binary classification scenario, i.e., paraphrased or not, the inclusion of fine-grained paraphrase types poses a significant challenge. While most approaches are good at generating and detecting general semantic similar content, they fail to understand the intrinsic linguistic variables they manipulate. Models trained in generating and identifying paraphrase types also show improvements in tasks without them. In addition, scaling these models further improves their ability to understand paraphrase types. We believe paraphrase types can unlock a new paradigm for developing paraphrase models and solving tasks in the future.

computational linguistic, dataset, proceedings, (12 more...)

doi: 10.18653/v1/2023.emnlp-main.746

2310.14863

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > Germany > Lower Saxony > Gottingen (0.14)
(21 more...)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)

arXiv.org Artificial IntelligenceFeb-17-2022

Revisiting the Evaluation Metrics of Paraphrase Generation

Shen, Lingfeng, Jiang, Haiyun, Liu, Lemao, Shi, Shuming

Paraphrase generation is an important NLP task that has achieved significant progress recently. However, one crucial problem is overlooked, `how to evaluate the quality of paraphrase?'. Most existing paraphrase generation models use reference-based metrics (e.g., BLEU) from neural machine translation (NMT) to evaluate their generated paraphrase. Such metrics' reliability is hardly evaluated, and they are only plausible when there exists a standard reference. Therefore, this paper first answers one fundamental question, `Are existing metrics reliable for paraphrase generation?'. We present two conclusions that disobey conventional wisdom in paraphrasing generation: (1) existing metrics poorly align with human annotation in system-level and segment-level paraphrase evaluation. (2) reference-free metrics outperform reference-based metrics, indicating that the standard references are unnecessary to evaluate the paraphrase's quality. Such empirical findings expose a lack of reliable automatic evaluation metrics. Therefore, this paper proposes BBScore, a reference-free metric that can reflect the generated paraphrase's quality. BBScore consists of two sub-metrics: S3C score and SelfBLEU, which correspond to two criteria for paraphrase evaluation: semantic preservation and diversity. By connecting two sub-metrics, BBScore significantly outperforms existing paraphrase evaluation metrics.

evaluation metric, paraphrase generation, revisiting

2202.08479

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.87)

Chowdhury, Jishnu Ray, Zhuang, Yong, Wang, Shuyi

Novelty Controlled Paraphrase Generation with Retrieval Augmented Conditional Prompt Tuning

arXiv.org Artificial IntelligenceFeb-1-2022

Paraphrase generation is a fundamental and long-standing task in natural language processing. In this paper, we concentrate on two contributions to the task: (1) we propose Retrieval Augmented Prompt Tuning (RAPT) as a parameter-efficient method to adapt large pre-trained language models for paraphrase generation; (2) we propose Novelty Conditioned RAPT (NC-RAPT) as a simple model-agnostic method of using specialized prompt tokens for controlled paraphrase generation with varying levels of lexical novelty. By conducting extensive experiments on four datasets, we demonstrate the effectiveness of the proposed approaches for retaining the semantic content of the original text while inducing lexical novelty in the generation.

novelty, paraphrase generation, proceedings, (13 more...)

2202.00535

Country:

Asia > China > Hong Kong (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)
(2 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)